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1. INTRODUCTION 

The determination of the closest pair of points (CPP) in a two-dimensional plane is one of the basic 
problems of computational geometry. Similarity matching, pattern recognition, computer vision and position 
estimation are some of the applications of CPP [1] and [2]. One of the typical approaches to solving the 
problem on the closest pair of points is the brute force method. In this method, the computation of distances 
of all pairs of points is necessary to find out the closest pair. To compute for the total number of comparisons 
of brute force method, equation (1) was used. 


(n-1)n 


T= () 
where: 

TC - The number of comparisons 

n — - Total number of distance to be compared. 


The divide-and-conquer algorithm was used to improve the brute force method. The study of [3] 
used this approach to solve the closest pair problem in a 2-dimensional plane. It utilized the concept 
introduced by [4] on the closest pair of points problem and modified the divide-and-conquer algorithm by 
computing at least seven pairwise comparisons in the central slab. Another enhancement was presented by 
[5] wherein an improved version of the divide-and-conquer algorithm on the planar set used at most four 
pairwise comparisons in the combining step for each point in the central slab. In 2006, the study of [6] 
showed that the divide-and-conquer algorithm pairwise comparison is possible to be reduced in the 
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combining step in the central slab for their enhanced version. Later, the study of [7] presented another 
enhancement of divide-and-conquer algorithm called the Basic-2 where two pairwise comparisons were used 
in the combining step for each point in the central lab. 

In 2015, the Pair-Pruned_CP algorithm of [8] was introduced. It is a simple and efficient algorithm 
to solve the closest pair of points problem. The algorithm functionalities focused on the computation of the 
distances of the pair of points which may be closer than the initial closest pair of points. It is a comparison- 
based deterministic approach to solve the CPP problem. 

Dynamic warping (DW) is a dynamic programming technique used to match the sequence for a 
specific application. Some of the applications of DW include the undirected graph model for calibrating and 
storing sequence-type radio-map which reduced the disparities of RSS. The study of [9] showed an increase 
in accuracy of 20% using dynamic warping. The interesting feature of dynamic warping is its heuristic 
approach to pattern searching. The warpmap of [9] is an accurate and efficient indoor localization in 
sequence type radio map. The key feature of this study was the capability of its system of accurate and 
efficient position estimation. 

Another CPP algorithm is the study of [10] that makes use of an algorithm to solve the closest pair 
problem with the aid of the tree model. It was also proven by the study of [11] that with the use of a divide- 
and-conquer approach, the closest pair of point problem can be initially solved in much higher dimensions 
for planar case. The divide-and-conquer algorithm makes it possible to lessen the number of comparisons 
which contribute to the efficiency of the approach. The study of [12] the divisions into several numbers of 
groups were considered in their experiment that decreases the number of comparisons. It was also proven by 
[13] and [14] used plane-sweep and optimal algorithm to solve the closest pair of point’s problem. 

The theory behind [15, 16] on the closest pair of points problems solutions had considered the 
application of several algorithms to reduce the expected running time. These algorithms are the same 
algorithm used for the locality sensitive hashing [17], light bulb problem [18] and coin flip of Rabin [19]. 

One of the similar closest pairs of point's problem is the time series motif mining. It is an application 
of CPP in large dimensional space that needs to have an efficient and accurate result. The study of [20] 
presented the first exact motif search algorithm for time series motif mining that outperformed the brute- 
force algorithm. This algorithm became the best performing algorithm in practice for time series motif 
mining. The same problem was also explored by several studies that used probabilistic and approximate 
algorithms for solving time series motif mining. The study of [21] applied it to the motion-motif graph. The 
study of [22] enhanced the time series motifs using the probabilistic algorithm. The study of [23] explored 
the possibility of application of time series in knowledge construction using a collaborative exploration 
system. The paper of [24] applied it to human motion and [25] have tried to find out that multivariate Motifs 
using subsequence density estimation and greedy mixture learning is possible and useful for time series motif 
mining reference. Both of the papers of [21] and [22] are concerned on the application of time series motif 
mining principle to machine learning wherein a large scale time series database are processed. 

Taking an account on its functionalities on the number of comparisons, the brute force method has 
the most number of comparisons of the distances while the divide-and-conquer algorithm has reduced such. 
In the case of coordinate-based similarity determination, the closer the reference point to the set of points, the 
more similar the reference point. 

Consider the reference point RG, j) and the set of points {A(x1, yi), B(x2, y2), C(x3, y3) and D(x, 
ys)}. The brute force method will find out all the distances of the reference point RG, j) from the five points 
{R, A, B, C, D} which accumulates 10 distance computations {RA, RB, RC, RD, AB, AC, AD, BC, BD, 
CD} and determine the minimum distance to find out the corresponding closest point from the reference 
point. The divide-and-conquer algorithm will find out the number of possible divisions and compute for the 
distances of each of the division of points. As a result, the number of comparisons between the distances 
is reduced. 

In this study, a new approach to solve the CPP problems is presented where the dynamic warping is 
applied to minimize the number of comparisons of distances between the reference point R(i, j) and the set of 
points P(Xn, yn). The computed distances are presented in coordinates-map to be used for similarity 
determination. The numbers of comparisons of the proposed algorithm are compared with brute force and 
divide-and-conquer methods of CPP. Future applications may consider the use of radio frequency 
identification and visible light communication for location estimation as studied by [26]. 


2. PROPOSED ALGORITHM 

The proposed modification of the brute force method of solving the CPP problem is presented by 
introducing the coordinates-map grid. A coordinates-map grid with 16 sample cells is illustrated in Figure 1. 
Figure 2 shows the coordinates-map grid. Each cell of the coordinates-map grid is the container of the 
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distance between the reference point R(i, j) and the set of points assigned to a particular cell. For example, 
the D11 cell contains the distance between the reference point R(i, j) and the coordinates P(1, 1) and D12 cell 
contains the distance between the reference point R(i, j) and the coordinates P(1, 2). 

















D14 D24 D34 D44 
D13 D23 D33 D43 
D12 D22 D32 D42 
Dil D21 D31 D41 

















Figure 1. Coordinates-map Grid 














0.6 1.2 2.1 3.1 
0.4 1.1 2.0 3.0 
1.4 1.7 2.4 3.3 
2.4 2.6 3.1 3.8 




















Figure 2. Sample Distance Output of Coordinates-map Grid 


2.1. Algorithm for Distance Computation and Storing of Distance in the Coordinates-Map Grid 
Input =: Reference Point {R (i, j)} and the set of points {(1, 1),..(1, 4), (2, 1)..(2, 4)...(4,1..(4, 4) } 
Output : Distances between the reference point (R) and the set of points (P) in the coordinate-map grid 


Loop0: 
For x =1 to 4 
Loop1: 
For y= 1 to4 
D,y = distance between R(i, j) and P (x, y) 
inc y 
Store D,, in the coordinates-map grid 
if x<=4 then 
Goto loop 1 
Else 
x=1 
inc y 
Goto loop0 
Endif 
End for 
End for 
Sample output: 


Input __: Reference Point {R(1, 3.4)} and the set of points {(1, 1),..(1, 4), (2, 1)..@, 4)...44,D..(4 4 } 
Output : Distances between the reference point (R) and the set of points (P) in the coordinate-map grid 


2.2. Schematic Steps that Determine the Minimum Distance and the Set of Point P(X, Y) Closer to 
Reference Point R(I, J) 
Input —_: Distances between the reference point R(i, j) and P(x, y) in the coordinates-map grid. 
Output : Closest point P(x, y). 
1. Using the coordinates-map grid of Figure 1, start from the upper right corner of the coordinates- 
map grid using the schematic step presented in Figure 3. 


D34 "7 | Start 


D33 D43 


























Figure 3. The schematic step 
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2. Follow the schematic step of Figure 3. 





D14 D24 D34| D44 
D13 D23 D3 








D12 D22 D32 D42 
D11 D21 D31 D41 














Figure 4. Illustration of the schematic step to the coordinates-map grid 


3. Determine the minimum value of the four cells (D44, D34, D33, and D43). 
4. Move to the next step by considering the minimum distance value as the “Start” of the next 
schematic step. (Assumption: D34 is the minimum value). 


| bial _24| 034] aa 
D2. 


D13 D33 D43 
D12 D22 D32 D42 


Figure 5. Illustration of the next Schematic Step 








5. If the minimum distance value did not change anymore, the corresponding coordinates of the 
minimum distance are the closest coordinates of the reference point R (i, j). 


Sample output: 

Input —_: Distances between the reference point R(1, 3.4) and P(x, y) displayed in the coordinates-map grid. 
Output : Closest point P(x, y). 

Steps: 


PNW PWM 
PNW PWM 


PNW PWM 
PNW PW 





Figure 8. Third Step Figure 9. Summary of Steps Process 


Output: P(x, y) = P (1, 3). Therefore, the reference points R (1, 3.4) and P (1, 3) are the closest pairs of 
points. 
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3. RESULTS AND ANALYSIS 

The proposed algorithm was executed with the use of the Java programming language. Point R (1.3, 
2.2) was used as the reference point (R) and 100 points {(1, 1),..(1, 10), (2, 1)..(2, 10)...(10,1)..(10, 10) 
samples were used as the set of points (P). These reference points (R) and the sample set of points (P) were 
used to demonstrate how the proposed algorithm works. 


3.1. Computation of Distance 
Input _: Reference Point {R(1.3, 2.2)} and the set of 100 points {(1, 1)..(10, 10)} 
Output : Distances between {R(1.3, 2.2)} and the set of 100 points 

Table 1 shows the summary of computed distances of coordinates-map. A 10x10 coordinates-map 
grid was used to present the computed distances between the reference point and the set of points. Figure 11 
shows that the minimum distance is the distance located at P(1, 2) which is equivalent to 0.4. Hence, the 
closest pair of points is R(1.3, 2.2) and P(1, 2). 


Table 1. Summary of Computed Distance of Coordinate-Map 





78 78 8.0 8.3 8.6 9.1 9.7 10.3 11.0 11.7 
6.8 6.8 7.0 13 7.7 8.3 8.9 9.5 10.3 11.0 
5.8 5.8 6.0 6.4 6.9 LS 8.1 8.9 9.6 10.5 
4.8 4.9 5.1 5.5 6.1 6.7 ves) 8.2 9.1 9.9 
3.8 3.9 4.2 4.7 3:3 6.0 6.9 77 8.6 9.5 
2.8 2.9 3.3 3.9 4.6 mee] 6.4 13 8.2 9.1 
1.8 1.9 2.5 3.2 4.1 5.0 6.0 6.9 7.9 8.9 
0.9 1.1 1.9 2.8 3.8 4.8 5.8 6.7 77 8.7 
0.4 0.7 iL 2] 3.7 47 5.7 6.7 17 8.7 
1.2 1.4 2.1 3.0 3.9 4.9 5.8 6.8 78 8.8 





3.2. Determination of Closest Point to the Reference Point 
Input : Distances between the reference point R(1.3, 2.2) and 100 set of points. 
Output : Closest point P(x, y). 












































Figure 11. Summary of Execution Process during the Determination of Closest Point 
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3.3. Derivation of the Number of Comparisons of the Proposed CPP Algorithm 

The brute force method compares all the distances between each pair of points while the divide-and- 
conquer algorithms divide the set of distances into two and compare all the distances between each pair of 
points of each division. Applying equation (1) of the brute force method and divide-and-conquer for 100 
distances to be compared resulted in TC = 4950 and TC = 2450, respectively. In the proposed CPP 
algorithm, the number of comparisons only depends on the number of schematic steps. The equation (2) is 
derived to compute for the number of comparisons: 


(n-1)n 


Ku = Ae) (2) 
where: 
Ku_ - The number of comparison of the proposed CPP algorithm 
A - Number of schematic steps 
n — - Total number of distances to be compared 


Applying the equation (2), the number of comparisons of the proposed CPP algorithm is Ku = 60 
with n = 4 and the worst case value of A = 10 for 100 distances value. 


3.4. Data on the Comparisons of other Number of Distances 

Table 2 shows the presentation of the number of comparisons between the brute force, divide-and- 
conquer algorithm and the proposed CPP algorithm using 100, 10,000 and 1,000,000 distances, respectively. 
To determine the number of comparisons of the distances between the reference point and set of points for 
brute force and divide-and-conquer algorithm, the equation (1) was used. Moreover, to determine the number 
of comparisons of the distances between the reference point and set of points for the proposed CPP 
algorithm, the equation (2) was used. In addition, the proposed CPP algorithm used 10x10, 100x100 and 
1000x1000 coordinates-map grid and considered the worst case value of schematic steps (A) of 10, 100 and 
1000 respectively. 


Table 2. Number of Comparisons of Brute Force Method and the Proposed CPP 





, Brute Force Method Divide-and-conquer Proposed CPP algorithm 
Number of Distances : : 
fs he Compared (Number of Algorithm (Number of Comparisons) 
P Comparisons) (Number of Comparisons) With A=10, 100, 1000 respectively 
100 4,950 2,450 60 
10,000 49,995,000 24,995,000 600 
1,000,000.00 4.99 x10! 2.44 x10! 6,000 





4. CONCLUSION 

The modified brute force algorithm is the proposed algorithm for solving the CPP problem. The 
proposed algorithm is simple and easy to follow. The experiment results showed that the number of 
comparisons of the proposed CPP algorithm is less than the brute force method and the divide-and-conquer 
algorithm. Moreover, the proposed CPP algorithm is efficient in terms of the number of comparisons as 
proven by the experiments of different sets of computed distances. The proposed algorithm may be 
implemented on coordinate based applications like position estimation and similarity matching to further test 
its functionalities. 
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